Picture for Wenguan Wang

Wenguan Wang

Uncertainty-Aware Gaussian Map for Vision-Language Navigation

Add code
May 26, 2026
Viaarxiv icon

3D Gaussian Map with Open-Set Semantic Grouping for Vision-Language Navigation

Add code
May 26, 2026
Viaarxiv icon

Clinically-Grounded Counterfactual Reasoning for Medical Video Diagnosis

Add code
May 26, 2026
Viaarxiv icon

AxiomOcean: Forecasting the Three-Dimensional Structure of the Upper Ocean

Add code
May 11, 2026
Viaarxiv icon

Learning 3D Representations for Spatial Intelligence from Unposed Multi-View Images

Add code
Apr 12, 2026
Viaarxiv icon

SinkTrack: Attention Sink based Context Anchoring for Large Language Models

Add code
Apr 11, 2026
Viaarxiv icon

PKINet-v2: Towards Powerful and Efficient Poly-Kernel Remote Sensing Object Detection

Add code
Mar 17, 2026
Viaarxiv icon

Iris: Bringing Real-World Priors into Diffusion Model for Monocular Depth Estimation

Add code
Mar 17, 2026
Viaarxiv icon

History-Enhanced Two-Stage Transformer for Aerial Vision-and-Language Navigation

Add code
Dec 17, 2025
Figure 1 for History-Enhanced Two-Stage Transformer for Aerial Vision-and-Language Navigation
Figure 2 for History-Enhanced Two-Stage Transformer for Aerial Vision-and-Language Navigation
Figure 3 for History-Enhanced Two-Stage Transformer for Aerial Vision-and-Language Navigation
Figure 4 for History-Enhanced Two-Stage Transformer for Aerial Vision-and-Language Navigation
Viaarxiv icon

Moving Beyond Diffusion: Hierarchy-to-Hierarchy Autoregression for fMRI-to-Image Reconstruction

Add code
Oct 25, 2025
Viaarxiv icon